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We study the nonparametric calibration of exponential Levy models with infinite jump activity. 
In particular our analysis applies to self-decomposable processes whose jump density can be 
characterized by the k-function, which is typically nonsmooth at zero. On the one hand the 
estimation of the drift, of the activity measure a := fe(0+) + k(0— ) and of analogous parameters 
for the derivatives of the k-function are considered and on the other hand we estimate nonpara- 
metrically the k-function. Minimax convergence rates are derived. Since the rates depend on a, 
we construct estimators adapting to this unknown parameter. Our estimation method is based 
on spectral representations of the observed option prices and on a regularization by cutting off 
high frequencies. Finally, the procedure is applied to simulations and real data. 
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1. Introduction 

Since Merton [19] introduced his discontinuous asset price model, stock returns were 
frequently described by exponentials of Levy processes. A review of recent pricing and 
hedging results for these models is given by Tankov [26]. The calibration of the un- 
derlying model, that is in the case of Levy models the estimation of the characteristic 
triplet (cr, 7, v), from historical asset prices is mostly studied in parametric models only, 
consider the survey paper of Eberlein [10] and the references therein. Remarkable ex- 
ceptions are the nonparametric penalized least squares method by Cont and Tankov [9] 
and the spectral calibration procedure by Belomestny and Reifi [3]. Both articles con- 
centrate on models of finite jump activity. Our goal is to extend their results to infinite 
intensity models. A class which attracted much interest in financial modeling is given by 
self-decomposable Levy processes, examples are the hyperbolic model (Eberlein, Keller 
and Prause [11]) or the variance gamma model (Madan and Seneta [18], Madan, Carr 
and Chang [17]). Moreover, self-decomposable distributions are discussed in the financial 
investigation using Sato processes (Carr et al. [7], Eberlein and Madan [12]). Our results 
can be applied in this context, too. The nonparametric calibration of Levy models is 
not only relevant for stock prices, for instance, it can be used for the Libor market as 
well (see Belomestny and Schoenmakers [4]). In the context of Ornstein-Uhlenbeck pro- 
cesses, the nonparametric inference of self-decomposable Levy processes was considered 
by Jongbloed, van der Meulen and van der Vaart [14]. 

Owing to the infinite activity, the features of market prices can be reproduced even 
without a diffusion part (cf. Carr et al. [6]) and thus we study pure-jump Levy processes. 
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More precisely, we assume that the jump density satisfies 
k(x) 

u( dx) — - — p dx, where k : K — > M + has bounded variation. (K) 

M 

When k increases on (— oo,0) and decreases on (0,oo), it is called k-function and the 
processes is self-decomposable. Further examples which have property (K) are compound 
Poisson processes and limit distributions of branching processes as considered by Keller- 
Ressel and Mijatovic [16]. Using the bounded variation of k, we show that the estimation 
problem is only mildly ill-posed. While the Blumcnthal-Getoor index, which was esti- 
mated by Belomestny [1], is zero in our model, the infinite activity can be described on 
a finer scale by the parameter 

a := k(0+) + fc(O-). 

Since k is typically nonsmooth at zero, we face two estimation problems: First, to give a 
proper description of k at zero, we propose estimators for a and its analogs k^(0+) + 
£U)(0— ), with j > 1, for the derivatives of k as well as for the drift 7, which can be 
estimated similarly. We prove convergence rates for their mean squared error which turn 
out to be optimal in minimax sense up to a logarithmic factor. Second, we construct 
a nonparametric estimator of k whose mean integrated squared error converges with 
nearly optimal rates. Owing to bid-ask spreads and other market frictions, we observe 
only noisy option prices. The definition of the estimators is based on the relation between 
these prices and the characteristic function of the driving process established by Carr 
and Madan [5] and on different spectral representations of the characteristic exponent. 
Smoothing is done by cutting off all frequencies higher than a certain value depending on 
a maximal permitted parameter a. The whole estimation procedure is computationally 
efficient and achieves good results in simulations and in real data examples. All estimators 
converge with a polynomial rate, where the maximal a determines the ill-posedness of 
the problem. Assuming sub-Gaussian error distributions, we provide an estimator with 
a-adaptive rates. The main tool for this result is a concentration inequality for our 
estimator a which might be of independent interest. 

This work is organized as follows: In Section 2 we describe the setting of our estimation 
procedure and derive the necessary representations of the characteristic exponent. The 
estimators are described in Section 3, where we also determine the convergence rates. 
The construction of the a-adaptive estimator of a is contained in Section 4. In view of 
simulations and real data we discuss our theoretical results and the implementation of 
the procedure in Section 5. All proofs are given in Section 6. 

2. The model 

2.1. Self-decomposable Levy processes 

A real valued random variable X has a self- decomposable law if for any b € (0, 1) there is 
an independent random variable Zf> such that X = bX + Zi,. Since each self-decomposable 
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distribution is infinitely divisible [21, Prop. 15.5], we can define the corresponding self- 
decomposable Levy process. Self-decomposable laws can be understood as the class of 
limit distributions of converging scaled sums of independent random variables [21, Thm. 
15.3]. This characterization is of economical interest. If we understand the price of an 
asset as an aggregate of small independent influences and release from the ^pn scaling, 
which leads to diffusion models, we automatically end up in a self-decomposable price 
process. 

Sato [21, Cor. 15.11] shows that the jump measure of a self-decomposable distribution 
is always absolutely continuous with respect to the Lebesgue measure and its density can 
be characterized through (K) where k needs to be increasing on K_ and decreasing on 
R+. Note that self-decomposability does not affect the volatility a nor the drift 7 of the 
Levy process. 

Assuming a = and property (K), the process X t has finite variation and the char- 
acteristic function of Xt is given by the Levy-Khintchine representation 

pr(u) :=E[e iuXT ] =exp(r(i 7 u + J°° (e lux - 1)^ ds)) . (2.1) 

Motivated by a martingale argument, we will suppose the exponential moment condition 
E[e Xt ] = 1 for alH > 0, which yields 

= 7 + fV-l)^dz. (2.2) 

J-00 m 

In particular, we will impose f'^° QO {fi x — 1)t^t da; < 00. In this case ifT is defined on the 
strip {z £ C|Imz € [-1,0]}. 

Besides Levy processes there is another class that is closely related to self-decompos- 
ability. Assuming self-similarity, that means (Y at ) = (a H Y t ), for all a > and some 
exponent H > 0, instead of stationary increments, Y t is a Sato processes. Sato [20] showed 
that self-decomposable distributions can be characterized as the laws at unit time of these 
processes. From the self-similarity and self-decomposability follows for T > 

<Py t (u) = W uYt \ = W THuYl \ = ex P f iT H lu + r (e iux - 1) ^ ^ dx] . 

\ J-^ \X\ J 

Since our estimation procedure only depends through equation (2.1) on the distributional 
structure of the underlying process, we can apply the estimators directly to Sato processes 
using T s = 1,7 S = T H ^j and k s (-) = k(T~ H u) instead of T, 7 and k. However, we 
concentrate on Levy processes in the sequel. 

For self-decomposable distributions the parameter a captures many of its properties 
such as the smoothness of the densities of the marginal distributions [21, Thm. 28.4] and 
the tail behavior of the characteristic function. This holds even for the more general class 
of Levy processes that satisfy property (K) . Recall that k has bounded variation if and 
only if 

n 

\\k\\rv '■= SU P I \k(xi) — k(xi-i)\ : n G N, —00 < xq < ■ ■ ■ < x„ < 00 j < 00. 
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In particular, ||fc||yv < oo implies a < oo. Similarly to deconvolution problems, the 
stochastic error in our model is driven by \<Pt{u ~ z)| _1 and thus we prove the following 
lemma in the appendix. 

Lemma 2.1. Let X t have property (K) and a = and let the martingale condition 
(2.2) hold. 

i) If ||e a; fc(a;)||ii < oo and qk := sup xg ( 01 i k ( x ) +k ( x ) a < qq ^/j en ^ ere exists a 
constant C v = C V {T, max{gfc, ||e a: fc(x)||ii , ||&||tv}) > smc/i that for all u G R 
wii/i |u| > 1 we obtain the bound 

\v T {u-i)\>cM~ Ta - 

ii) Let a.R > i/ien |</?t(w — i)| > C V (T, R)\u\~ Ta holds uniformly over all \u\ > 1 
and a/Z urai/i a < a and maxjqfc, ||e :r fc(x)||ii , ||fc||:n/} < -R- 

The value gfc as defined in the lemma can be understood as the largest slop of k near 
zero. If the process is self-decomposable than < holds and the bounded variation 
norm equals a. Otherwise, we can use qk < supui^ |fc'(x)| and ||A;||ry < ll^'IU 1 ! assuming 
the derivative k' exists, is bounded on [— 1, 1] and integrable on K. If either a > or 
property (K) is violated, <pt can decay faster than any polynomial order, for example, 
consider self-decomposable processes with a = oo [21, Lem. 28.5]. Hence, the conditions 
of Lemma 2.1 are sharp. 



2.2. Asset prices and Vanilla options 

Let r > be the risk-less interest rate in the market and Sq > denote the initial value 
of the asset. In an exponential Levy model the price process is given by 

S t = S e rt+x \ 

where X t is a Levy process described by the characteristic triplet (<r, 7, v). Throughout 
these notes, we assume X t has property (K) and a — 0. On the probability space (O, J-, P) 
with pricing (or martingale) measure P the discounted process (e~ rt St) is a martingale 
with respect to its natural filtration (.Ft). This is equivalent to E[e Xt ] = 1 for all t > 
and thus, the martingale condition (2.2) holds. 

At time t — the risk neutral price of an European call option with underlying S, time 
to maturity T and strike price K is given by e _rT E[(SV — ^)+]> where A + := max{0, A}, 
and similarly e~ rT K[(K — St)+] is the price of European put. In terms of the negative 
log-forward moneyness x :— log (if/ So) — r T the prices can be expressed as 

C(x,T) = S E[(e XT -e x )+] and V(x,T) = S E[(e x — e Xr ) + ]. 

Carr and Madan [5] introduced the option function 

nM — / s a lc ( x ' T )> x >°> 
[ ' ' \ S^VfaT), x<0 
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and set the Fourier transform TO(u) := e lux O(x) dx in relation to the characteristic 
function ipx through the pricing formula 

■F0(u)= 1 T ( "7' ) ^ ueR\{0}. (2.3) 
u(u — t) 

The properties of O were studied further by Belomestny and Reifi [3, Prop. 2.1]. In 
particular they showed that the option function is contained in C 1 (K \ {0}) and decays 
exponentially under 

Assumption 1. We assume that Ci :— ¥\e 2XT \ is finite, which is equivalent to the 
moment condition E[5 t 2 ] < oo. 

Our observations are given by 

O^Oix^+S^, j = l,...,N, (2.4) 

where the noise (ej) consists of independent, centered random variables with E[e 2 ] = 1 
and supj E[e^] < oo. The noise levels 5j are assumed to be positive and known. In 
practice, the uncertainty is due to market frictions such as bid-ask spreads. 

2.3. Representation of the characteristic exponent 

Using (2.1) and (2.3), the shifted characteristic exponent is given by 

i)}(u) := ± log(l + m(l + m)JO(u)) = i \og(ip T (u - i)) (2.5) 



X 



i 1 u + 1 + I (e l ^ x -l)^-dx (2.6) 



for u € M. Note that the last line equals zero for u = because of the martingale 
condition (2.2). Throughout, we choose a distinguished logarithm, that is a version of 
the complex logarithm such that iji is continuous with ^(0) = 0. Under the assumption 
that (1 V e x )k{x) dx 1 is finite we can apply Fubini's theorem to obtain 

ip(u) = iju + 7 + / i(u~i)J 7 (sgn(x)k(x))((u-i)t)dt, (2.7) 
Jo 

where the Fourier transform ^(sgn-fc) is well-defined on {z G C|Imz G [—1,0]}. Typi- 
cally, the k and its derivatives are not continuous at zero. Moreover, if a ^ the function 
x i y sga(x)k(x) has a jump at zero in every case. Therefore, the Fourier transform de- 
creases very slowly. Let k be smooth on R\ {0} and fulfill an integrability condition which 
will be important later: 



x We denote A A B := min{A, B} and A V B := max{A, B} for A, B g M. 
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Assumption 2. Assume k G C s (R\{0}) with all derivatives having a finite right- and 
left-hand limit at zero and (1 V e x )k(x), . . . , (1 V e x )k^(x) € L 1 (E). 

To compensate those discontinuities, we add a linear combination of the functions 
hj(x) := x j e- x l [0<oo) (x),x € R, for j = NU {0}. Since hj G C^QR) for j > 1 and 
all hj are smooth on M\ {0}, we can find ctj, j — 0, . . . , s — 2, such that sgn(x)fc(x) — 
SJ=o a jhj{x) is contained in C S_2 (IR) nC s (f \ {0}). This approach yields the following 
representation. The proof is given in the supplementary article [27] . 

Proposition 2.2. Let s > 2. On Assumption 2 there exist functions D : { — 1, 1} — > C 

and p : K \ {0} — > C such that is bounded in u and it holds 

i/>(u) = J>(sgn( M )) + vyu - a lo S (M) + Y) U ~ ■ J " 3 ' + p(u), u £ 0. (2.8) 

i=i 

TTie coefficients are given by ctj — jj (k^\0+) + k^\0— )) — X)m=i — a j'-m, especially 
ao = a holds. 

Representation (2.8) allows us to estimate 7 and cxq, . . . a s _2- A plug-in approach yields 
estimators for k^'(0+)+k^(Q—),j — 0, . . . , s— 2. Since we only apply this representation 
when ip is multiplied with weight functions having roots of degree s — 1 at zero, the poles 
that appear in (2.8) do no harm. 

Proposition 2.2 covers the case s > 2. For s = 1 we conclude from (2.6), the martingale 
condition (2.2) and Assumption 2 

iP(u)=iju+ [ (e lux - l)e x ^dx = vyu + i f T( sgn(x)e x k{x)) (v) dv, (2.9) 



Hence, if) is a sum of a constant from the integration, the linear drift iju and a remainder 
of order log \u\, which follows from the decay of the Fourier transform as | 1 . One can 
even show [27, Cor. 8] that there exists no inconsistent estimator of a for s = 1. 
Therefore, we concentrate on the case s > 2 in the sequel. 

Equation (2.9) allows another useful observation. Defining the exponentially scaled 
k-function 

k e (x) :— sgn(x)e x k(x), x E R, 
we obtain by differentiation 

T l + (i«- u ^O(„) =«7 + t^M«)- (2-10) 



Using this relation, we can define an estimator of fc e . 
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3. Estimation procedure 

3.1. Definition of the estimators and weight functions 

Given the observations {(x\, Oi), • • ■ , (xat, On)}, we fit a function O to these data using 
linear B-splines 

■= + 1 [x j ,x 3 - +1 ]: J = 1, ■ ■ • ,N, 

Xj-Xj-i Xj+i-Xj 

and a function /3 with /3g(0+) — /3q(0— ) = — f to take care of the jump of O': 

N 

6(x) = f3 (x) +^20 j b j (x), xeR. 

3=1 

We choose [3q with support [xj -i,Xj ] where jo satisfies Xj -i < < Xj . Replacing 
O with O in the representations (2.5) and (2.10) of ip and i/j' , respectively, allows us to 
define their empirical versions through 

i>(u) := - log (v K ( u )(l + m(l + iu)F6(u)) 

- 1 (i - 2u)FO(u) -(u + tu 2 )F(x6(x))(u) 

W \ u ) '■— ™ = ! u € R, 

T v K{u) (l + iu(l + iu)FO{u)) 

where k is a positive function and we apply a trimming function given by 
v K (z) :C\{0}^C, 



Z, \z\ > K, 

Kz/\z\, \z\ < K 



to stabilize for large stochastic errors. A reasonable choice of n will be derived below. 
The function ip is well-defined on the interval [—U,U] on the event 

A := {lu e : 1 + iu(l + iu)T{6{uj, •))(«) ^ OVw € [-[/, [/]} € J 7 . 

For u) £ Q \ A we set ?/> arbitrarily, for instance equal to zero. The more O concentrates 
around the true function O the greater is the probability of A. Sohl [23] shows even that 
in the continuous-time Levy model with finite jump activity the identity ¥(A) — 1 holds. 

In the spirit of Belomestny and Reifi [3] we estimate the parameters 7 and ay , j — 
0, ... s — 2, as coefficients of the different powers of u in equation (2.8). Using a spectral 
cut-off value U > 0, we define 

u 
-u 
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and for < j < s — 2 

*u 



Re(ip(u))w^.( u ) dw, if j is even, 
-(7 3 
-u 

~Lm(ip(u))Wa . (u) du, otherwise. 

-u 3 

The weight functions and w^. are chosen such that they filter the coefficients of 
interest. Owing to (2.10), the nonparametric object k e can be estimated by 

t f v . = i?- 1 [(-7 - i^(«))JTV*(«/C0] (a:), x > 0, 

\j- 1 [(-7-#'H)^fe(-V^)]W ) ^<«. 

applying a one-sided kernel function Wk with bandwidth U~ x since we know that k e 
jumps only at zero. The condition on the weights are summarized in the following 

Assumption 3. We assume: 

• fulfills for all odd j £ {1, . . . , s — 2} 

cU r U 



vT 3 w~. \u) du = and I uE (±it) du = 0. 
-u Jo 



w^q satisfies for all even j £ {1, . . . , s — 2} 

rU rU 

tog(|u|)«£ (ti)du = -l, / u- j w^ Q (u)du = and / (±u) du = 0. 

• For j = 1, . . . , s — 2 £/ie weight functions w^. fulfill 2 

^M'-Ff' »-<(«) <i« = an, ^ <( ± «)d„ = 0, 

where 1 < / < s — 2 and Z zs even /or even j and odd otherwise. For even j we 
impose additionally 

u 

log(M)u£ (u) du = 0. 

-c/ 

• is o/ Sobolev smoothness Ta + 2, i.e. J(l + \u\ 2 ) T&+2 \FW k {u/U)\ 2 du < oo, 
has support supp Vl^ C (— oo, 0] and fulfills for I = 1, . . . , s — 1 

Wjfe(a;)da; = l, / x'Wj^a;) dz = and x^^Wkix) £ L\R). 



2 For a £ R let [a] denote the largest integer which is smaller than a. 
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Furthermore, we assume continuity and boundedness of the functions u i-> u~ s+1 uiq(u) 
forqe {7, a ,...,a s _ 2 }- 

The integral conditions can be provided by rescaling: Let satisfy Assumption 3 for 
q G {7, «0j • • • j «s-2} and U = 1. Since 1 = uw^u) du = f^ v uU~ 2 w} f (u/U) dit, we 
can choose (u) := U^w^fj). Similarly, a rescaling is possible for w^ Q : 

-1 = £fog(|«|)<(«)d« = ^lcgC^f/"^-)^- 1 ^ J^wK^du 

u u 
log(| M |)C/- 1 < o (-)d U . 



-u 



Therefore, we define w^ (u) := U 1 w\ {jj) and analogously w^.(u) :— W 1 w\.{jj). 
The continuity condition on in Assumption 3 is set to take advantage of the decay of 
the remainder p. In combination with the rescaling it implies 

|<(u)| < ET-VI- 1 and \w"(u)\ < = 0, . . . , a - 2. (3.2) 

Throughout, we write A < B if there is a constant C > independent of all parameters 
involved such that A < CB. In the sequel we assume that the weight functions satisfy 
Assumption 3 and the property (3.2). 

We reduce the loss of k e by truncating positive values on M_ and negative ones 
on M + . In the self-decomposable framework there are additional shape restrictions of 
the k- function which the proposed estimator k e does not take into account. The mono- 
tonicity can be generated by a rearrangement of the function. To this end let k(x) :— 
(sgn(x)e~ x k e (x) V 0)l[-c,c]{ x )i x &R, where we bounded the support with an arbitrary 
large constant C > 0. The rearranged estimator which is increasing on K_ and decreasing 
on K + is then given by 




{k{z)>y} 



dz < 



x\, x > 0, 



k*{x):={ ) J0 C (3.3) 

dz < \x\ \, x < 0. 



!'•' ■--> .»! "' - 



Chernozhukov, Fernandez- Val and Galichon [8] show that the rearrangement reduces 
weakly the error for increasing target functions on compact subsets. This result carries 
over to our estimation problem. 



3.2. Convergence rates 

To ensure a well-defined procedure, an exponential decay of O, the identity (2.9) and to 
obtain a lower bound of \(Pt{u — i)|, we consider the class Qo(R,a). Uniform convergence 
results for the parameters will be derived in the smoothness class G s (R,a). 
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Definition 3.1. Let s € N and R, a > 0. FFe define 

i) Go(R,a) as the set of all pairs V = (7, k) where k is of bounded variation and the 
corresponding Levy process X given by the triplet (0,j,k(x)/\x\) satisfies Assump- 
tion 1 with C2 < R, martingale condition (2.2) as well as 

ae[0,a] and max { sup { k ( x )+ k ^~ x )~ a ||fc e (x)|| L i , Hfc||ry} — 

x£(0,l] 

ii) Q s (R,a) as the set of all pairs V = (7, fc) G t/o(^7^) satisfying additionally As- 
sumption 2 with 

|fc w (0+) + k (l) {0-)\ <R, forl = l,...,8-l, 
||(lVe*)fc«(aO|| £ i < R, forl = 0,...,s. 

In the class Go{R, ct) Lemma 2.1 ii) provides a common lower bound of |</?t(w — *)| for 
|u| > 1. Using max lE t i-cos(a;) £ ^ ^ wg es ti ma t;g r0U ghly for u G ( — 1, 1) \ {0}: 



\<p T (u-i)\ = cxp(V /°° C0S ^ 1 e a; /l ll lfc(x/| M |)d 2 :) 



>e~ TR 



Hence, the choice 



ie-™, Id < 1, 



k(u) := K a (u) :— < ? , 
V ^ W \iC v (T,i?)|d-'" 

satisfies 



|C v (T,i?)|u|- Ta , |u|>l, 



l\ip T (u-i)\ > k(u), uel, (3.4) 

where the factor 1/3 is used for technical reasons. As discussed above, we can restrict our 
investigation to the case s > 2. Since the Levy process is only identifiable if O is known 
on the whole real line, we consider asymptotics of a growing number of observations with 

A := max (xj — Xj—i) — > and A := minfxjv, — xi) — > 00. 

j=2,...,JV 

Taking into account the numerical interpolation error and the stochastic error, we analyze 
the risk of the estimators in terms of the abstract noise level 



e :- 



Theorem 3.2. Let s > 2, R, a > and assume e~ A < A 2 and A||(5|| 2 2 < \\5\\ 2 lx . We 
choose the cut-off value U & '■— £ _2 /( 2s + 2Tq + 1 ) to obtain the uniform convergence rates 

sup E P [| 7 -7| 2 ] 1 / 2 < e 2s /( 2s+2Ta+1 ) and 
sup E 7J [|a i -a J -| 2 ] 1 /a< e 2(-i-i)/(2-+2ra+i) ) ; „ s ■>. 
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As one may expect the rates for aj,j = 0,...,s — 2, become slower as j gets closer 
to its maximal value because the profit from the smoothness of k decreases. Note that 
the cut-off for all estimators is the same. In contrast to Q s (R,a) we assume Sobolev 
conditions on k e in the class "H S (R, a) in order to apply L 2 -Fourier analysis. 

Definition 3.3. Let s E N and R, a > 0. We define H s (R,a) as the set of all pairs 
V = (7,fc) € Q {R,a) satisfying additionally k G C S (R \ {0}), E v [\X T e XT \] < R for 
corresponding Levy process X as well as 

\j\<R, and \\k^\\ L 2<R, for I = 0, . . . , s. 

In the next theorem the conditions on A and 5 are stronger than for the upper bounds 
of the parameters which is due to the necessity to estimate also the derivative of ip. 
However, the estimation of %p' does not lead to a loss in the rate. As seen in (3.1) we 
need 7 to estimate k e . 

Theorem 3.4. Let s > 1,R, a > and assume Ae~ A < A 2 as well as A(||5j|| 2 2 + 
IK^j^jOillp) S IHI/oo- Using an estimator^/ which satisfies sup-p E-p[|7 — 7I 2 ] < 00 and 
choosing the cut-off value U a '■= £~ 2 /( 2s + 2Tq + 5 ) ; W e obtain for the risk of k e the uniform 
convergence rate 

sup E v [\\k e - fc e ||i 2 ] 1/2 < e 2s /( 2s + 2Ta+5 ). 

V=(-y,k)en s (R : a) 

Remark 3.5. The convergence rates in the Theorems 3.2 and 3.4 are minimax optimal 
up to a logarithmic factor, which is shown in the supplementary article [27]. 



4. Adaptation 

The convergence rate of our estimation procedure depends on the bound a of the true 
but unknown a G R + . Therefore, we construct an a-adaptive estimator. For simplicity 
we concentrate on the estimation of a itself whereas the results can be easily extended 
to 7, ctj , j = 1, . . . , s — 2, and k e . In this section we will require the following 

Assumption 4. Let R > 0, s > 2 and a G [0, a] for some maximal a > 0. Furthermore, 
we suppose e~ A < A 2 and A||<5|| 2 2 < ||i$|| 2 oo. 

These conditions only recall the setting in which the convergence rates of our param- 
eter estimators were proven. Given a consistent preestimator a pre of a, let <So be the 
estimator using the data-driven cut-off value and the trimming parameter 

U := U &pre := e - 2 /( 2s + 2T ^ +1 ) and (4.1) 
«(«) :=«a^(«) := <^ \ n ,_ Ta ' ' 4.2 

\u\ Ta *>™, U > 1, 
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respectively, with a pre :— a pre + | log e| — 1 . If a pre is sufficiently concentrated around the 
true value, the adaptation does not lead to losses in the rate as the following proposition 
shows. Note that the condition do £ [0, a] is not restrictive since any estimator d of 
a £ [0, a] can be improved by using (0 V a) A a instead. 

Proposition 4.1. On Assumption 4 let a pre be a consistent estimator which is inde- 
pendent of the data Oj,j — 1, . . - ,N, and fulfills for e — > the inequality 

n\a pre -a\ > | logel" 1 ) < de 2 (4.3) 

with a constant d £ (0, oo). Furthermore, we suppose do £ [0, a] almost surely. Then do 
satisfies the asymptotic risk bound 

SUp E VApTe [\& - a] 2 } 1 / 2 < £ 2( S -l)/(2 S +2Ta+l) 

veg s (R,a) 

where the expectation is taken with respect to the common distribution Pp* of the 
observations 0\, . . . , On and the preestimator a pre . 

To use do on an independent sample as preestimator, we establish a concentration 
result for the proposed procedure. We require (ej) to be uniformly sub-Gaussian (see 
e.g. van de Geer [28]). That means there are constants C\,G2 £ (0, oo) such that the 
following concentration inequality holds for all t, N > and a\, . . . £ K 

t 2 



P(| E a > e i >t)<Ci exp ( - ^2-^^) ■ (4.4) 

Proposition 4.2. Additionally to Assumption 4 let (ej) be uniformly sub-Gaussian 
fulfilling (4.4). Then there is a constant c > and for all k, > there is an e ~ 
K (2s+2Ta+i)/(2s-2)^ suc ^ f na f j or all e < Eq A I the estimator a satisfies 

P(|d - a\ > k) < ((7N + l)d + 2) exp ( - c(n 2 A K i/2) e -(»-i)/(2*+2Ta+l)) . (4.5) 

Concentration (4.5) is stronger than needed in Proposition 4.1. To apply the proposed 
estimation procedure, let S pre and S be two independent samples with noise levels e pre 
and e as well as sample sizes N pre and N, respectively. Using S pre for the estimator a pre , 
we construct adaptively do on S. We suppose N pre grows at most polynomial in e pre , 
that is Npre ^ £ p re holds for some p > [cf. 27]. To satisfy (4.3), it is sufficient if there 
exists a power q > 0, which can be arbitrary small, such that e pre ~ e q owing to the 
exponential inequality (4.5). Using e 2 > An/N > l/N, we estimate 

] --P^ < £-P £ 2 ~ £ 2 -Pt 



for q < 2 1 p. Thus, relatively to all available data the necessary number of observations 
for the preestimator tends to zero. 
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a 


E[|a -a| 2 ] 1/2 


E[|Qo-a| 2 ]V2 


40 


20.7998 


23.3589 


20 


5.8362 


7.7724 


10 


1.0505 


2.4534 


4 


0.1729 


1.1158 



Table 1. Risk of estimating a with oracle (middle column) and adaptive (right column) choice of 
cut-off value U in simulated variance gamma model with v £ {0.05, 0.1, 0.2, 0.5}, a = 1.2, 8 = —0.15. 





i 


E[|,f- 9 | 2 ]i/2 


E[|g- 9 | 2 ]i/2 


7 


0.1408 


0.0065 


0.0126 


ceo 


10.0000 


1.0505 


2.4534 


"l 


-94.1667 


32.1016 


77.5311 




II II 1/2 
11911/2 


nu-q\\ 2 L 2\ 1/2 


E[||9-9lli 3 ] 1/a 




0.9556 


0.4075 


0.5602 



Table 2. Risk of estimating the parameters 7,a,ai and the k-function with oracle (middle column) 
and adaptive (right column) choice of the cut-off value U in simulated variance gamma model 

= 0.2, cr = 1.2, 6 = -0.15). 

5. Discussion and application 
5.1. Numerical example 

We apply the proposed estimation procedure to the variance gamma model. In view 
of the empirical study [17] we choose the parameters v e {0.05, 0.1, 0.2, 0.5}, a = 1.2 
and 9 = —0.15. the martingale condition (2.2) yields then 7 = - log(l — 6v — a 2 v/2). 
According to the different choices of v, we set a = 40 as maximal value of a. 

The deterministic design of the sample {x\, . . . , xn} is distributed normally with mean 
zero and variance 1/3. The observations Oj are computed from the characteristic function 
ifiT using the fast Fourier transform method [5]. The additive noise consists of normal 
centered random variables with variance ^©(x,)! 2 for some 6 > 0. 

We estimate q £ {7, ao, &i, k}. Hence, we need s > 4 [see 27, Cor. 8]. By self- 
decomposablity of the model we apply the rearranged estimator k* given by (3.3). We 
use maturity T = 0.25, interest r — 0.06, smoothness s — 6, sample size N — 100 and 
noise level S = 0.01, which generates values of e on average 0.168. The results of 1000 
Monte Carlo simulations are summarized in Tables 1 and 2. 

In order to apply the estimation procedure, we need to choose the tuning param- 
eters. Owing to the typically unknown smoothness s, let the weight functions satisfy 
Assumption 3 for some large value s max . The weights for the parameters can be chosen 
as polynomial whereas is taken as a polynomial times a smooth function with support 
[—1,0]. The trimming parameter k is included mainly for theoretical reasons and is not 
important to the implementation. The most crucial point is the choice of the cut-off value 
U. For q we implement the oracle method U — argminy >0 \q{V) — q\ and an adaptive 
estimator q based on the construction of Section 4 with sample size N pre = 25 for a pre . 
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5.2. Discussion 

Due to the nonparametric setting, our estimators converge more slowly than with ^/n 
rate as in parametric models [10, 11, 17]. Although the studied estimation problem is 
only mildly ill-posed compared with classical nonparametric regression models and thus 
the polynomial rates are faster than in nonparametric models with a > which achieve 
logarithmic rates only [3] . In order to understand the convergence rate of the estimators 
for 7 and oij better, we rewrite equation (2.10) in the distributional sense, denoting the 
Dirac distribution at zero by 8 , and differentiate representation (2.8) 

s-2 

ip'(u) =J : (i'ySo + ik e )(u) = r ^J/'./ln.,/ ' ; + p'(u), u eR\{0}. 

3=0 

Hence, ip' can be seen as Fourier transform of an s-times weakly differentiable function 
and estimating 7 from noisy observations of ip' corresponds to a nonparametric regression 
with regularity s. Since dividing by u on the right-hand side of the above equation 
corresponds to taking the derivative in the spatial domain, the estimation of ay is similar 
to the estimation of the (j + l)th derivative in a regression model. The convergence rate 
of k e is in line with the results of Belomestny and ReiB [3] for a = since their rate 
equals ours in the compound Poisson case a = 0. 

For k e the degree of ill-posedness is given by Ta + 2. This can be seen analytically by 
observing that the noise is governed by u 2 \ipt(u — , which grows with rate Ta + 2. 
From a statistical point of view a higher value of a leads to a more active Levy process 
and hence, it is harder to distinguish the small jumps of the process from the additive 
noise. The influence of the time to maturity T on the convergence rates is an interesting 
deviation from the analysis of Belomestny and Reifi [3] . The simulation shown in Table 1 
demonstrates the improvement of the estimation for small the values of a. The estimators 
7 and k e provide a complete calibration of the model. Although, estimating the k-function 
at zero is most important and thus additional information through dy are crucial. Table 2 
contains simulation results for the estimators q and q, q 6 {7, ao, ati, k e }, corresponding 
to oracle and a-adaptive cut-off values, respectively. This adaptation to a is a first step 
to a data-driven procedure and should be developed further. 

Since the estimating equation (2.9) holds for all Levy processes with finite vari- 
ation, the proposed estimator k e can be more generally understood as estimator of 
xe x i/(dx). Thus, the estimation procedure can be applied to exponential Levy models 
with Blumenthal-Getoor index larger than zero, for example tempered stable processes. 
However, the analysis of the convergence rates does not carry over to more general Levy 
processes since the polynomial decay of the ipx, which is guaranteed by property (K), 
is essential for our proofs. Moreover, if k has no bounded variation the behavior of the 
Levy density at zero needs different methods and should be studied further. For instance, 
Belomestny [1] discusses the estimation of the fractional order for regular Levy models 
of exponential type. 

Even if the practitioner prefers specific parametric models that might achieve smaller 
errors and faster rates, the nonparametric method should be used as a goodness-of-fit 
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T 0.314 0.567 

r 0.045 0.044 

Npre 20 21 

N 81 85 



7 0.101 0.344 

a 34.848 23.600 

Si 239.348 147.699 



Table 3.. Adaptive estimation based on DAX op- 
tions from 29 May 2008 with time to maturity T 

and N + N pre observations. Figure 1. Observed DAX option prices (points and 

triangles) from 29 May 2008 with fixed maturity 
T = 0.314 and different log strike prices x as well 
as the option function generated from the estimated 
model (solid line). 

test against model misspecification. To construct such tests, confidence sets need to be 
studied which is done by Sohl [24] in the framework of Levy processes with finite activity. 
Based on this asymptotic analysis, Sohl and Trabs [25] construct confidence intervals in 
the self-decomposable model. 

5.3. Real data example 

We apply our estimation method to a data set from the Deutsche Borse database Eurex 3 . 
It consists of settlement prices of put and call options on the DAX index with three and six 
months to maturity from 29 May 2008. The sample sizes are 101 and 106, respectively. 
The interest rate is chosen according to the put-call parity. The sub-sample for the 
preestimator consists of every fifth strike while the main estimation is done from the 
remaining data points. By a rule of thumb the bid-ask spread is chosen as 1% of the option 
prices. Therefore, we get noise levels e with values 0.0138 and 0.069 for the two maturities, 
respectively. Table 3 shows the result of the proposed method. As one would expect, the 
jump activity is smaller for a longer time to maturity. The estimator k(x) = e~ x k e (x) as 
well as the rearranged estimator k are presented in Figure 2. In Figure 1 the calibrated 
model is used to generate the option function in the case of three months to maturity, 
where the data points used for the preestimator are marked with triangles in the figure. 
For a comparison of the outcome of our estimation procedure with the spectral calibration 
of Bclomcstny and Reifi [3] we refer to Sohl and Trabs [25] . 

^provided through the Collaborative Research Center 649 "Economic Risk" 
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-0.5 0.0 0.5 -0.5 0.0 0,5 

X X 



Figure 2. Using ODAX data from 29 May 2008 with three (top) and six (bottom) months to maturity, 
estimation of the function k with (right) and without (left) rearrangement. 

6. Proofs 

6.1. Proof of the upper bounds 

Let us recall some results of [3]: Because of the B-spline interpolation we obtain Oi(x) := 
E[0(a;)] = ^2jLi 0(xj)bj(x) + 0o(x),x G R. Furthermore, the decomposition of the 
stochastic error if) — if) in a linearization C and a remainder TZ, 

C{u) := T- x ip T (u - - u)uF{6 - 0){u), K(u) := tJj(u) - ${u) - C(u), 

u£l, has the following properties: 

Proposition 6.1. i) Under the hypothesis e~ A < A 2 we obtain sup ugR |E[.FC)(u) — 
^©(u)]! = sup ueR \FOi{u) — J : 0{ii)\ < A 2 uniformly over all Levy triplets satis- 
fying Assumption 1. 

ii) If the function k : K — > satisfies (3.4) then for all u G K the remainder is 
bounded by \Tl{u)\ < T~ x k{u)~ 2 {u A + u 2 )\ T(Q - 0)(u)\ 2 . 
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Upper bound for 7 and ctj (Theorem 3.2): 

Since Theorem 3.2 can be proven analogously to Theorem 4.2 in [3], we only sketch the 
main steps. Note that in Q s {R,a) we can bound uniformly in representation 

(2.8) [cf. 27, Lem. 9]. Let us consider 7 first. The definition of 7 and , the decomposition 
of ip and representation (2.8) yield 



7 



i-u 

Hence, we obtain 



u 



Im(^(u))«c(tt)dit = 7+ / Jm{p(u) + C{u) + K(u))w^(u) du. 



-v 



E[|7-7l ] < 3 / p(u)w^(u)du 



U 2 
U 
1 

u 



+ 3E 



u J 
C(u)w^ (u) du 



3E 



TZ(u)w^(u) du 



where all three summands can be estimated separately. The first one is a deterministic 
error term. It can be estimated using the decay of p{u) and the weight function property 
(3.2): 



p{u)w^{u) du 



< 



[/- (s+1) |p(«)« s " 1 |dw S U~ 



A bias-variance decomposition, with the definition Var(Z) := E[|Z — E[Z]| 2 ], of the linear 
error term yields 



E 



u 2 
L{u)w^{u) du 

-u 



3 T [l ( u)u ., nno o){u)\ w v{u) du 2 

-u Tip T {u-i) 

u (i — u)u 



Var 



-u T<p T {u - i) 



F6{u)wV{u)du) =:£l+£ v 



Using the approximation result in Proposition 6.1, the bound of \(pr{u — i)\ 1 given by 
kT x and property (3.2), we infer the estimate of the bias term 



\L b \ < A 2 [/- (s+1) 



\ip T [u-i) 



- 1 \u\ s+1 du<A 2 U T&+1 . 



For the variance part we make use of the properties of the the linear spline functions bk 
as well as supp(iu^) C [-U, U] and the independence of (efc). We estimate (Cov(Y, Z) := 
E[(Y-E[Y])(Z-E[Z})}) 



r 



1 " u * (j - u)u 
-u ~"' \Tip T (u - i) 



-u 

N 



Cov 1 



FO(u), J' , V)V F6(v)) W V(u)wV(v)dudv 
1 <pt{v — 1) > 



^=1 



° r} 1 , U)U .^ b k (u) w V(u)du 2 < A\\S\\^U 2T&+1 . 
u Tip T {u-i) 
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To estimate the remaining term 1Z, we use Proposition 6.1, the property (3.2) of and 
the choice of k. In addition the independence of (e^) and the uniform bound of their 
fourth moments comes into play. 



E 



< 



u 
u u 



7Z(u)w}j (u) du 



-u -u 

: (a 4 



u A w^ (u) v 4 (v) 
< \2 ( \2 dudv 



u u 4 W U(u) 



n(u) 2 



du 



U N 4„ Uf \ 

u 



-u 



< (a 4 C/-( s+1 ) / k(u)- 2 \u\ s+3 duj + (A 2 ||5||f 2 C/-( s+l ) / K ( u )- 2 | M | s+3 d w ) 



<[/ 4Ta + 6 (A 8 + A 4 |j(5|| 4 2 ). 

Therefore, the total risk of 7 is of order 

E[|7 - 7I 2 ] £ + U 2T&+1 (A 4 U + ApHfoo) + (7 4Ta + 6 (A 8 + A 4 ||<5|| 4 2 ) 

uniformly over Q S (R, a). Since the explicit choice of U = U & — e _2 /( 2s + 2TQ + 1 ) fulfills 
U < A -1 and A||(5|| 2 2 < 11x112 



lao holds by assumption, this bound simplifies to 



E[|7"7| 2 ] <U- 



2 s 



U 



2Ta+l £ 2 



u 



4Ta+6 £ 4 



Here U & balances the trade-off between the first and the second term whereby the third 
term is asymptotically negligible. We obtain the claimed rate. 

For otj,j = 0, . . . , s — 2, the only difference to the analysis for 7 is the rescaling factor 
of vr^. in (3.2). Since its square appears in front of every term, we verify 



E\ 



2 ] < [/- 2 ( S - 1 -J) 



u 



2Ta+2j+3 



{A 4 U + A\\S\\^) + u 4Ta+2 i +s (A 8 + A 4 \\S\\ 4 2 ) 



< jj-2(s-l-j) + u 2T&+2j+3 £ 2 + u 4Ta+2j+S £ 4 



Upper bound for k e (Theorem 3.4)-' 

Similarly to the uniform bound of the bias of TO in Proposition 6.1, the following lemma 
holds true. It can be proved analogously to [3, Prop. 1] and thus we omit the details. 

Lemma 6.2. Assuming Ae~ A < A 2 , we obtain Sup ueR |E[J-"(a;(0 — 0)(x)) (u)]\ = 
sup MgR \T(x(Oi-0){x)){u)\ < A 2 uniformly over all Levy triplets satisfying Assumption 
1 andE[\X T e XT \] < 1. 



For convenience we write m := 2s — 1 and Wk '■= J~Wk such that Wk(u/U) = 
UT(W k (Ux))(u). Using ||/|| 2 2 - / \f(x)\ 2 dx+j R _ \f(x)\ 2 dx =: U/U^+H/H 2 ^) 
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for / € L 2 (R), it is sufficient to consider the loss of k e on R + . On R_ one can proceed 
analogously. We split the risk into a deterministic error, an error caused by 7 and a 
stochastic error, 

Ep[\\k e - k e \\ 2 L 2 {R+) ] 
=Ep[||^- 1 ((- 7 - #»K(^)) - MiW 



<E 7 



<3 



[/ 3 









((-7 - (x) - k e (x) 



3 



T-\{-i^'{u) + i^{u))w k Q){x) 



Ax 



^(^(uKC^^ajJ-fceCa:)! da; + 3E[| 7 - 7 | 2 ] / |t/W fe (i7x)| 



ds 



+ 3E 
:D + G + S. 



[J 




1 JM + 





, u , 



d.T 



The support of W k yields G = 0. The deterministic term D can be estimated in the 
spatial domain, where we use the local smoothness of k e . For pointwise convergence 
rates this was done in [2]. We decompose using suppWfc C (— 00, 0] 



D = 3 

< 6 



U(k e *W k (U»))(x)-k e {x) 



Ax 



(k e (x + y/U) - k e {x))W k (-y) Ay 



Ax 



+ 6 



(k e (x + y/U) - k e (x))W k (-y) Ay Ax =: &{D X + D 2 ). 



Cauchy-Schwarz's inequality, the estimate | W k (— y)\ Ay < U 171 J R \y m W k (y)\ Ay 
U~ m and Fubini's theorem yield 

B x < 
< U 

<U 1 n .1/1 / i'., 1.' — <//< 11 -r i'., i.« m u.i <i// „• < » e iiL2. 



< 



\W k (-y)\Ay \k e (x + y/U)-k e (x)\ \W k (-y)\AyAx 
Ju 

(\k e (x + y/U)\ 2 + \k e (x)\ 2 )\W k (-y)\ Ay Ax 
\W k {-y)\ ( \k e (x + y/U)\ 2 + \k e (x)\ 2 AxAy<U- 2m \\k e \\ 
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Using a Taylor expansion, we split D 2 in a polynomial part and a remainder: 



dx 



U fc^ ) (z)(x + ? //t/-z) s - 



dzVF fe (-y) dy 



da; =: 2D 2P + 2D 2R . 



We estimate by /J 7 y j W k (-y) dy = - J~ y J 'W fc (-y) dy for j = 0, . . . , s - 1 

D2P<sU- 2m Y jJ ^- 2 [ \k^(x)\ 2 dx([ \y m W k (-y)\dy) 2 <U- 2mS ^2\\k^\\ 
With twofold usage of Cauchy-Schwarz and with Fubini's theorem we obtain 



2 

L 2 ■ 



2fl 



^ ki s) (x + z)(y/U- z) s - 



< 



< 



jo 

u 



( s -i)\ 

\ki s) (x + z)\ 2 dz 



■dzW k (-y)dy 



d.r 



"-Jo 
u 

/ TT 



1/2 , f* (It, - z )2s-2 1/2 



|fc«(z + *0| 2 dz|W fc (-i,)|dy 



((«-l)0 2 

!7 C„./rn2s-l 



dzj I Wfc(-y) I dyj dx 



Jo 



(y/uy 

o (2 S -1)(( S -1)!> 



r|W fc (-i/)|dyda; 



<[/-(2s-l) 



[/ r y/(7 



JO 



\ki s \x + z)\ 2 dxdz\W k (-y)\dy 



<u-^\\k^\\h 



u 



- y \ Wk (-y)\dy<U- 2s . 
u 



Therefore, we have D + G< £/~ 2s + UE[\j - 7 | 2 ]. 

To estimate the stochastic error 5, we bound the term \ip'(u) — ip'(u)\. Let us introduce 
the notation 

ip T (u - i) := w K ( u )(l + (iu - u 2 )J"(5(u)), 

(p' T (u — i) := (i — 2u)J-(D(u) — (u + iu )T(x6(x))(u), ueR. 

For all u £ E where \(pr(u — i)\ > k(u) we obtain (pr(u — i) = 1 + (iu — u 2 )FO(u). For 
\<Pt( u — i)\ = the estimate |^r(w — *) — <^t(w — i)| > 2k(u) follows from (3.4). This 
yields 

\((>t{u — i) — (pr(u — i)\ < |1 + (iu — u 2 )FO(u) — ipr(u — i)\ + n(u) 



< |1 + (iu — u 2 )FO(u) — tpx(u — i)\ + -\(pt(u — i) — fr( u — i) 



Calibration of self-decomposable Levy models 



21 



Therefore, \(pT(u — i) — (pT(u — i)\ < 2|l + (iu — u 2 )FO(u) — (Pt(u — i)\ holds for all u 6 
We obtain a similar decomposition as [15], 



\${u)-^\u)\ 



(p' T {u — i) ip' T (u — i) 



<p T (u-i) <pr(u-i) 
~ T\{p T (u-i)\ ^' TtyU ~^ ~ ^^'^ + T W (. u )\\vt{u - i) - tp T (u-i)\) 

- 2T^y + 4m2)1/2 + 2T ^'^ u2 + u4 ) 1/2 ) 

+ (u 2 + u 4 ) l ' 2 \F(x{6 - 0)(a:))(u)|). 
Since W{u)\ < \j\ + \\k e \\ L i < 2R, we have 

W{u) - ^'{u)\ < -^((1 + u 2 )\F(0 - 0){u)\ + (u 2 + u 4 )^ 2 \T(x(d - 0)(x))(u)\ 
It follows with Plancherel's equality 



S <3E 



< 



r 1 ((^'(!i)-/(ti)) ) » t WP))| 



L 2 



3 

2^ 



¥.[W{u)-4,'{u)\ 2 ]\w k (u/U)\ 2 &u 



du 



=:Si + S 2 



Both terms can be estimated similarly. Thus, we only write it down for S2, where stronger 
conditions are needed. Lemma 6.2 and |j J r (xbj(x))\\ cx> < 2A(xj + A), j = 1, . . . , N, yield 



S 2 < 



(\\xiOi - 0)(x)||L + Var {T{x6{x)){u)))\w k {u/U)f 

N 



du 



2TS+5 



<(A 4 + A 2 ||(^-)ll^+A 4 || ( 5 J ||? 2 )f/ 2Ta+5 < £ 2 t/ 

Therefore, we have shown E[||fc e - k e \\ 2 L2 T ] < U~ 2s + e 2 U 2TSl+5 + UE[\j - 7 | 2 ]. The 
assertion follows from the asymptotic optimal choice U = U & = £~ 2 /( 2s + 2Ta + 5 ) and the 
assumption on the risk of 7. 



6.2. Proof of Proposition 4.1 

Step 1: Let (a E ) E>0 be a deterministic sequence such that there is a constant C > 
with \a e — a I < Cllogej" 1 . Let the estimator d use the cut-off value U e := U ae and 
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the trimming parameter n £ := K Be , with a e := a £ + C|loge| , as defined in (4.1) and 
(4.2). Then we can show the asymptotic risk bound sup-p eg jR, a ) ^vil&o ~ ce] 2 ] 1 ^ 2 < 
£ 2(s-i)/(2s+2Ta+i) ag f n ows: gy construction holds a < a e . Hence, n E fulfills condition 
(3.4) for each pair V <G Q s (R,a) and thus we deduce from Theorem 3.2 

[|d - a\ 2 ] < C/-2(-D + u 2Ta+3 e 2 + U^ +S e 4 

=£ 4(s-l)/(2s+2Ta E + l) + £ 4T(a e -a)/(2s+2Ta e + l) + £ (4s-8+8T(a E -a E ))/(2s+2Ta e + l) \ 

The first factor has the claimed order, since £ 4( s -i)/(2 S +2Ta e +i) < £ 4( s -i)/( 2s +2Ta+i) 
follows with easy calculations from (a — a e ) loge < C. Hence, the claim follows once we 
have bound the sum in the bracket of equation (6.1). For the second term this is implied 

by 



4T(q e -q) , 
2s+2Ta e + l iu 5 c 



< 4T|(a e -a)loge| < 4TC 
— 2s+l — 2s + l' 



To estimate the third term, we obtain from s > 2 and e < 1 

4 s -8+8T(q e -a.) , < -STCIloger 1 , < &TC 
2s+2Ta c + l iu & fc — 2s+l lu & fc — 2s+l " 

Step 2: Let V € Gs(R, ct). Note that K e satisfies the condition (3.4) on the set {|d pre — a\ < 
| loge| -1 }. Using the independence of a pre and Oj, the almost sure bound ccq < a and 
the concentration of a pre , we deduce from step 1: 

E-p,Q pre [\&0 ~ Ol\ 2 ] <M-p,&pre E -P,a p „ [|«0 - a| 2 |dpre] l{|Q p „-Q|<|loge|-i} 

+ 4a 2 P <Vc (|d pre -a| ^lloger 1 ) 

< £ 4( S -l)/(2 S+ 2Ta+l) +4 -2 d£ 2^ 

Since the second term decreases faster then the first one for e — > 0, we obtain the claimed 
rate. 



6.3. Proof of Proposition 4.2 



Recall that the cut-off value of do is given by U = £ - 2 /( 2s + 2Tq + 1 ). F r n > we obtain 
from the definition of the estimator and the decomposition of the stochastic error into 
linear part and remainder: 



p(|d -a| >k) = : 



< 



-u 
u 



Re(p + %j) — ip)(u)wa {u) du 



-u 



P(u)w^ (u) du 



> 



+ 



•(|^ ft(u)<,(u)du 



> 



> K 
■U 
-U 
= ■■ Pi + P 2 



Re(£(u))< ( M ) du 
P 3 - 



> 
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We will bound all three probabilities separately. To that end, let Cj,j € N, be suitable 
non-negative constants not depending on k, e and N. 

The event in Pi is deterministic. Hence, the same estimate on the deterministic error 
as in Theorem 3.2 



/*(«)»«„(«) du 



<c,U-^ 



2(s-l)/(2s+2Ta+l) 



yields P x = for all e < £ « := ( K /(3 Cl )) (2s+2Ta+1)/(2s - 2) . 

To bound P 2 we infer from the definition of £, the linearity of the errors in O 
Oi + YujLi 3j £ jbj an d from the estimate of the term in Theorem 3.2 



Re(£(u))wV(u)du 



u (u^ + u 2 ) 1 / 2 
_u T\<p T {u-i)\ 



U Rc i r l ( U)U -^ {6 O)(u)) w v o (u)du 
_ v \Ttp T (u-i) J 

u 



Re 



(i — u)u 



N 



u \Tip T {u-i) 



AT 



2rrTa + 2 



(i — u)u 



Re 



_ a \T(p T (u-i) 



Jfy(u) )w ao (u) du 



A r 



< C2£ 2( S -l)/(2 S +2Ta + l) + IJ^a^ 



3=1 



where the coefhcients are given by aj := 8j J U [/ Re ( t ^ t ^-i) ( u ) ) w a ( u ) ^ u f° r 3 

1 Y. To apply (4.4), we deduce from H-F&jHoo < 2A, the weight function property 

(3.2) and the assumption A||5|| 2 2 < \\S\\ 2 X 



N - ( r U (lA U_ 7/ 2^1/2 \ 2 

E° 2 < E^f ( j_ u T\^{u-i)\ I^WII<(")I dttj <c^U 2 ^\\5\ 



< Ci e 2 U 2Ta+i = C4e 2(-i)/(2«+M , a+i) < 
This implies through the concentration inequality of (ej) 



N 



Pi < 



E a J £ 

0=1 



> 



+f>( C2 e 2{s ~ 1)/(2s+2Ta+1) > - 
i / V 6 



< Cl CXp f - i^ K 2 e -2( S -l)/(2 S+ 2T a+ l)\ 

V 36c4 / 



36c4 



for all e < := (k/(6c 2 )) 



(2s+2Ta+l)/(2s-2) 
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It remains to estimate probability P$. The bound of 1Z in Proposition 6.1 ii) yields 

u 



K{u)w u a (u)du 



-u 
u 



< 



u u 4 + u 2 



<2 



4 i 2 

u + u 
Tk(u) 2 



-u Tk(u) 2 
\T(O l -O)(u)\ 2 \ W V (u)\du + 2 



\HO-0){u)\ 2 \w u aQ (u)\du 
u 

.A , ..2 , 



4 , 2 w 

ir + u \ 



Tk(w) 2 



The first addend gets small owing to Proposition 6.1 i) 

u 4 + u 



U 



Tk(u)'- 



'-\T{O l - O)(u)\ 2 \w u ao {u)\ du <||^(0, - J 



if + U , y . 



^ „,4 , „,2 
<C 5 A 4 C/ 2Ta+4 < C5£ 2( S -l)/(2 S +2Ta+l) > 



For the second one we obtain 



N N N j-l 



3=1 3=1 J=2 fe=l 



Thus, 



„[/ W N j-l 

/ R(u)<(«) du < 2c 5 £otI+t + 2^5| £ |^ i (C/) + 4^ ^ &Mi e kti,k{V) 

J ~ U 3=1 j=2fc=l 

with £j,k(U) ■= J U U tk^u) 2 ^ e (•Fbj(u)J 7 bk(— u))\w^ o (u)\ du. Denoting the diagonal term 
and the cross term as 

N N j-l 

D N : Y. S ] £ %AU) and r v : SjS^e^U), 

3=1 J=2 fe=l 



respectively, we obtain 



P 3 <P(2c 5 e 2(s - 1)/(2s+2Ta+1) > ~) +P(2^ > ~) +P( 



The first summand vanishes for e < := (K/(18c5))^ 2s+2Ta+1 ' ) ^ 2;i 2 \ To estimate the 
probabilities on and Un, we establish the bound 



|^(C/)|<||^||oo||^ fc | 



17 u 4 + u 2 
Tk(u) 2 



" + "' |<>)|d u <c 6 A 2 [/ 2Ta + 4 (6.2) 



for j,k = I, . . . ,N. Hence, 

| ^ c 6 A 2 ||£|| 2 2 ;7 2Ta+4 < c 7 e 2 U 2T&+4 < Cl£ ^-D/^+^+i) ^ 

3=1 
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which yields together with (4.4) 



P(l*>£)<P( 



JV 



k=l,...,N 



> 



18 



< 



sup |e fc | > — — e 

fe=l,...,JV -L8C7 



-2(s-l)/(2s+2Ta+l) 



< CiTVexp 



^2 .._-2(s-l)/(2s+2Ta+l) 



18c 7 



To derive an exponential inequality for the U-statistic Un, we apply the martingale idea 
in [13]. Because of the independence and the centering of the (ej), the process (Un)n>i 
is a martingale with respect to its natural filtration (J- N ) (setting U\ = 0): 



N-l 



E[U N - Un-i\T N -^ = E[ E ^4ejve fc ^,fe(f/)l^-i 



fe=i 

We apply the martingale version of the Bernstein inequality [22, Thm. VII. 3. 6] which 
yields for arbitrary t,Q,S > 

/ i 2 

P(\U N \>t)<2P((U) N >Q)+2V( max \U k - U k _ x \ > S) + 2exp ( - 



Hence, we consider the increment |t//v — t/iv-i| = ]£jv 
Denoting aN,k := SNdk^N,k(U), we estimate using (6.2) 

N-l iV-l 



J2k=l SNSk^N,k( U ) £ k 



4{Q + tS), 
(6.3) 

for N > 2. 



E = ^ E &, fe (C/) 2 < cgA^*+*d^||^ (6.4) 

k=l k=l 

< C 2 6 A 4 \\S\\t 2 U 4Ta+8 < C 2 7 e 4 U 4T&+8 < c 2 £ 4( S -l)/(2 S +2Ta+l) > 

Thus, by Assumption (4.4) we obtain for all S > 



N-l 



P(\Un - C/jv-iI > S) = F(\£n\\ a N,k£k 

fe=i 

iV-l 

<p(M > ^ e -(s-l)/(2 S +2Ta+l)^ + p (| £ 



> s 



O-N.k^k 



k=l 



> 



v / S , e (s_1)/(2s+2Ta+1) ) 



<d exp ( - C 2l 5 £ -2( S -l)/(2 S + 2 ra + l)^ + ^ exp ^ _ ^ S£ -2(s-l)/(2s + 2T &+ l)^ _ 

The quadratic variation of Un is given by 

N-l 

(U) N - (U)n-i = H(Un - f/jv-i) 2 |^_i] = 4( E te k £ N , k (Uy 



k=l 
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W.l.o.g. we can assume J2f =2 Sj > 0. Otherwise follows $3j_ 2 $j = which implies <5j = 
for all j = 2, . . . ,N and thus (1% = £f =2 ( (V)j-{U)j-i ) = °- ThcnP((C/) Ar > Q) = 



N N c2 

p((c/) N > q) = p(£ (([/). - (u).^) > q) <J2 ¥ (( u h Ms-i > ' 



N 

3=2 ' j=2 £-ik=2 u k 



l^k=2 u k 



N 3-1 

E p (n*ii"E' 

3=2 k=i 

2 srj-l i2t 



To apply inequality (4.4) we estimate ||<5|| 2 2 £Q ^Cj,fc(^) 2 < c§A*||£||f a l7* ra+8 < 
c 2 £ 4(s-i)/(2s+2Ta+i) analogous to (6.4) and obtain 

P((C/) JV > Q) < CiJVexp(- ^Q e -*(.-i)/(2-+2ra+i)^ 
We deduce from Bernstein's inequality (6.3) 
^36 



K 2 



<2P( > Q) + 2P( fc max w |£* - > S) + 2exp ( + ^ 

<2CiiVexp(- %Qe 



( ■_! -4(s-l)/(2s+2Ta+l) 

+ 4C 1 7Vexp f - J^_ 5e -2(*-i)/(2 a +2Ta + i)\ + 2 / _ _) , 

1 V V cf-Vl / F V 144(36g + nS) J 

By choosing Q = k5 and S* = 1 /^ e ( s - l )/( 2s + 2T "+ 1 ) we get 

> ^) < (6C1JV + 2) exp ( - c 8 min ( K i/2 £ -(-D/(2 S +2T fi+ i)^ _ 

For all e < we have K£ -2( S -i)/(2 S +2Ta+i) > K ( £ Wy 2{s - 1)/{2s+2T&+1) „ 1 and 
hence, 

P3 < HD N >±)+P(u N >±)< (7C.N + 2) exp ( - ^^-(-D/^Ta+i)) 

Putting the bounds of P\,Pi and P3 together yields for a constant c g (0, 00) and all 
e < e A 1 with e := minjeW, e^, e^} 

P(|fio - «| > «) < (7^ TV + Ci + 2) exp ( - c( K 2 A K i/ 2 ) £ -(-i)/(2 S +2T a +i)^ . 
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Appendix A: Proof of Lemma 2.1 

Part i) The martingale condition yields 

/ [°° e x k(x) \ 

\<Pt(u — i)\ — exp yT I ( cos(ux) — l) — — — dx J . 

^ J — oo \ x \ ' 

W.l.o.g. we assume T = 1, a > and u > 1 because of the symmetry of the cosine. 
We split the integral domain into three parts: 

\ Vl (u - i)\ = exp J\ J\ £ ) COS ^~ V /"fc(f) + e -»/»*(-£ )) dx) . (A.l) 

Using Halloo < ||fc||TV < oo by assumption and the constant C\ := f Q 1 ~ c ° s x dx 6 (0, oo), 
we estimate 

1 cosg-l ^ a/ _ g/ ^ dx > 2e i/u mx f 1 cos x ^ ^ „ 2Cie || fc || oo ^ 

X J Q X 

In the second part the dependence on u comes into play. Writing k(x) :— k(x) + k(—x), 
the Taylor series of the exponential function together with dominated convergence yield 

cosx- I fcx/^gj + e -/«fc(-2)) dx 

oo rU 



/ COS X 1 / X 

a." dx + 11*11°° E y ( cos * ~ dx 

A;— 1 

>-alog(u)+ f 1 ( a -k(x))— + f ^Mfc(x)d 2 ;-2||fc|| 00 Vi-(l- U - fe )) 

Jl/w ^ Jl/u x k = 1 K-K 

i / \ k(x) — a f 1 cos(ux) ~ 
> — alog(it) — sup 2e||K|| 00 + / k(x)ax. 

a;e(0,l] x Jl/u x 

To bound the last term in the above display, we proceed as [20, Lem. 53.9]. By the 
bounded variation of k we can define a bounded signed measure p via p((a, b]) — k(b+) — 
k(a+), < a < b. Noting that da; can be bounded uniformly y € [1, oo) with a 

constant C2 > 0, Fubini's theorem yields 

1 cos(wxO r . . [ u co&x, [ u cosx f x/u . , 
k(x)dx = / dx+ / / p{dy)dx 



l/u x J I x Jl x Jl/u 

Jl x Jl/uJuy x 

> k(i+)mm f C —dx-2C 2 [ \p\(dy) > -2C 3 II - 4C 2 \\k\\ Tv . 



v>l /-, X 
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Obtaining for the third part in (A.l) /" cos(ux ) ~ 1 (e x k(x)+e- x k(x)) dx > -2\\e x k(x)\\ L i , 
we have with q k as defined in Lemma 2.1 

Wi(u -i)\> exp (-q k - (2e + AC 2 + 2C 3 )||fc||rv ~ 2\\e x k(x)\\ L i) U - a . 

We deduce the estimate \lpt(u — i)\ > C V (T, q k , \\e x k(x)\\Li 7 ||fc||ry }) M -Tct for |u| > 1 
with C V (T,R) :=exp(-Ti?(3 + 2e + 4C2 + 2C 3 )). 

Part ii) follows immediately from the explicit choice of C v . 
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Germany. E-mail: trabs@math.hu-berlin.de 

This article is supplement to "Calibration of self-decomposable Levy models" (Trabs 
[5]). In Section 1, we derive the representation of the characteristic exponent stated in 
Proposition 2.2 of [5]. Moreover, we discuss Le Cam's asymptotic equivalence of the 
nonparametric regression considered in [5] to the continuous-time white noise model in 
Section 2. Therefore, we can show uniform lower bounds for the risks of the estimators 
derived in [5] in the latter model. All proofs are given in Sections 3 and 4. 

1. Representation of the characteristic exponent 

Recall that we consider a pure-jump Levy processes X t with characteristic triplet (0, 7, v) 
where 7 G R the jump measure satisfies v{ Ax) = k(x)/\x\, x G R \ {0} for some bounded 
variation function fc : R — > R + [cf. 5, property (K)]. Furthermore, we suppose that X t 
fulfills 

Assumption 2. Assume k G C s (R\{0}) with all derivatives having a finite right- and 
left-hand limit at zero and (1 V e x )k(x), . . . , (1 V e x )k ( - s '>{x) G L 1 (M). 

Using the Levy-Khintchine representation and assuming the martingale condition = 
T + J™oo( eX ~ dx > we obtain 

ip(u) := ^ log(ipT(u — i)) = i^/u + 7 + / i(u — i)T(sgn(x)k(x))((u — i)t) At. (1.1) 

Jo 

Adding a linear combination of the functions hj(x) :— x^ e~ x l[ oo - j (x), x G R, j G NU{0}, 
we can compensate the discontinuities of x i-> sgn(x)k(x) since for j > 1 it holds hj G 
C J_1 (R) and all hj are contained in C°° (R \ {0}) . Hence, we can find atj, j = 0, . . . , s — 2, 
such that 

s-2 

g(x) :=sgn{x)k(x)-^2a j h j (x) G C^ 2 (R) n C S (R \ {0}). 

1 



2 



M. Trabs 



These coefficients are given recursively by the following formula, which can be proved by 
straight forward calculations. We omit the details. 

Lemma 1. On Assumption 2 the factors ctj,j = 0, ... s — 2, satisfying g G C S_2 (R) (1 
C S (K\ {0}), can be calculated via 



1 J i 1 \m 

Xj = ( fc (i)(o+) + fc w)(o-)) - £ K -^r^-m- 

■'' m=l 



Hence, the Fourier transform in (1.1) can be written as .F(sgn -k){z) = jFg{z) + 
Sj=o a j-Fhj( z )i where integration by parts yields 

f°° i I 

ThAv-ti)= e <Ki(i-t))v'd 2; = - ■>■ «eM,te[0,l), 

and |.F(7(it)| decreases as |u| _s because of the smoothness of g. This leads to the following 

Proposition 2. Let s > 2. On Assumption 2 there exist functions D : { — 1,1} — > C 
and p : R \ {0} — > C such that is bounded in u and it holds 

^(ti) = J>(sgn( M )) + iTu - a log(M) + V) U ~ h j + p(u), u £ 0, (1.2) 
From the proof in Section 3 we deduce the form of the mapping D : {—1,1} — > C: 
£>(±1) = 7 T i\ ~ 5>' ~ + | ^(x) 6 ^ 1 dx ± i 1 -F^ 2 ^))^) d«. 



o 



2. White noise model and lower bounds 

To establish asymptotic lower bounds for the convergence rates of the estimators 7, dj, 
j = 0, . . . , s — 2 and fc e , we consider the continuous white noise model 

dZ v (x) = O v (x)dx + -=X N (x)dW{x), x G [—An, An], (2.1) 
v ./V 

with a two sided Brownian motion W, an option function O-p induced by the pair V G 
5 S (i?, a) and A^r > growing in N. Under certain conditions, we can apply the results 
of Brown and Low [2] to show the asymptotic equivalence of the regression model 

J - = 0(x J )+* i e J) j = l,...,N, (2.2) 

considered in [5] and the white noise model (2.1) for N — !• 00. Setting N and e in relation 
to each other allows us to derive lower bounds in terms of e. 
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To that end, we state the situation of (2.2) more precisely. Let N £ N and Hm : 
[— Aw,Aff] [0, 1] be an increasing, absolutely continuous distribution function with 

h N := H' N > a.c. on [-A N ,A N ] (2.3) 

such that the strikes are given by Xj — Hj^^j /(N + 1)) for j = 1, . . . ,N. Furthermore, 
let Sj = S(xj) for some function 5 £ L°°(R). We suppose that S is absolutely continuous 
satisfying the technical condition 

-^log5(x)\<C s , x£R, (2.4) 

for some constant Cg < oo. From Bclomestny and ReiB [1, Prop. 2.1] we know that 
O' is continuous on 1\ {0} with a jump at zero of height -1. ||0"||li < 3 implies 
l^'fa)! — I fa 0"{x) dx| < 3 for all x ^ and thus, O 1 is uniformly bounded. Especially, 
O is Lipschitz continuous for any V £ Q S (R, a)WH s (R, a). The equivalence result follows 
immediately from Brown and Low [2, Thm. 4.1, Cor. 4.2]. 

Proposition 3. Under the assumptions (2.3) and (2.4) the nonparametric regression 
model (2.2) with Sj ~ N(0, 1) iid. and the white noise model (2.1) are asymptotically 
equivalent in Le Cam's sense if X 2 N (x) = 8 2 (x) /hjq{x) for x £ [— An, An] . 

Remark 4. Grama and Nussbaum [3] showed the asymptotic equivalence of location 
type regression models with non-Gaussian noise to regression models with Gaussian noise. 
Hence, the condition on the distribution of £j in the preceding proposition can be relaxed 
to error laws with regular densities. 

By Proposition 3 asymptotic lower bounds in the regression model can be proved in 
the white noise setting equivalently. To this end we need a uniform lower bound of the 
noise level Xn/V^N in terms of e. Let the strike distribution be polynomial: 

h N {x)~Cj?(\x\ + l)-*, x£[-A N ,A N ], (2.5) 

for some q > and normalization constant Cm- This is reasonable since in practice 
most of the traded options are almost at-the-money whereas only less ones are far in- or 
out-of-the-money. Moreover, we suppose a minimal noise through 

S\x)>\\S\\ 2 00 (M + ^- p , xel, (2.6) 

where p > 1 is necessary. The restriction on p is because the condition A||(<Jj)||? 2 < 
||(<£j)lljL in the Theorem 3.2 in [5] implies necessarily 6 £ L 2 (R) which is due to Fatou's 
lemma: 

N 

\\S\\l 2 < liminf (a^) +J>, -x^S 2 ^)) < liminf A||(^)||f 2 < U^. 

In the same way A||(Zj£j)||? 2 ^ 11(^)11^°° implies even xS(x) £ L 2 (R) in the situation 
of Theorem 3.4 in [5]. In view of the condition e~ AN < A 2 ~ e 4 and Ajye~ 
respectively, we assume addionally An ~ loge" 1 . 
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Lemma 5. Let the properties (2.5) and (2.6) be satisfied. If An ~ loge 1 holds then 
we obtain Ajx ^ > e 2 (\oge~ 1 )~ 13 for f3 := max{p, q}. 

The proof of the lemma is straight forward and thus omitted. In the sequel we assume 
always that the model (2.1) satisfies the conditions of this lemma. 

Remark 6. The proof of the convergence rates of the estimators 7 and ctj [5, Thm. 
3.2] reveals that the condition A||(5|| 2 2 < ||^|| 2 °o i s only used to estimate the remainder 
term. In the case s > 3 our bound is not strict and we can replace the constraint by the 
weaker one 

& r \\6\\% < ¥\\L 2r for some r e ( 1, Zs + 2T& - 1 ' 
II IIP ~ II Ni« ^ , 2s + 2Ta + l. 

In this setting Sj can be bounded away from if A increases slowly enough whereas for 
r = 1 the noise 6j must tend to for Xj —> ±00. Otherwise A||5||? 2 could not be bounded 
because of AN > — > 00. Therefore, we can consider the case p € [0, 1) in Lemma 5. 
In the situation of = p = q we obtain (3 = and hence, the lower bounds proved in the 
next theorems imply that our estimation procedure achieves exact minimax rates for the 
parameters. With regard to k e the condition A|| (:Ej<5j)|| 2 2 ^ ll(^i)ll?<» yields j3 > p > 3. 

Theorem 7. Let s G N, s > 2,R,a > and j — 0,...,s — 2,. We obtain in the 
observation model (2.1) the asymptotic risk lower bounds 

inf sup E P [|7- 7 | 2 ] 1 /2> ( £ (l 0g£ - 1 )-' 3 / 2 ) 2s/(2s+2Ta+1) , 

7 V£G B {R,a) 

■ e r, n* |2ll/2 \ I n — 1 X _fl/2N 2(s — 1— j)/(2s+2Ta+l) , 

ml sup E-p[\aj — aj\\' > [e(\oge ) Pl ) and 

& i veQ 3 (R,a) 

inf sup Ep[||jfe e - fcelH,] 1 / 2 > ( £ (log £ - 1 )-' 3 / 2 ) 2s/(2s+2Ta+5) 

where the infimum is taken over all estimators, i.e. all measurable functions of the ob- 
servation Z . The bound for k e holds for s = 1 as well. 

As noticed these lower bounds are a logarithmic factor better then the convergence 
rates in the last section in the case /3 > 0. The following corollary extends the result to 
the parameter a s -i := fe' s_1 )(0+) + k( 8 ~^(0— ). Especially, the estimation of a in the 
case s = 1 is of interest. 



Corollary 8. Consider the situation of the Theorem 7 where s = 1 is also possible. For 
a s _i we obtain asymptotically the risk lower bound 

inf sup Ep[|oi s _i - a s ^\ 2 } 1/2 > 1 

in the observation model (2.1). Hence, we cannot estimate a s ~i consistently in the L 2 - 
sense. 



3. Proof of the characteristic representation 

Standard Fourier analysis yields the decay of \J-g(u)\: 

Lemma 9. Let f £ C S - 2 {R) H C S (R \ {0}) for s > 2 and f G C 1 (M\ {0}) in case o/ 
s = 1, respectively. Furthermore, we assume finite left- and right-hand limits of /' s_1 ) 
and at zero and / (0) , . . . , / (s ) G L 1 (M) . TTiera we ob£am 

|7-/(u)|<H- s /or |u|->oo. 

Especially, on Assumption 2 there is a constant C g > independent from t and u such 
that 

\Tg{v -it)\< C g \v\- S for t G [0, 1) and |v| > i. 

Proof. Part 1: Since / G C S ~ 2 (IR) has a piecewise continuous (s — l)th derivative and 
all derivatives are in L 1 (M), standard Fourier analysis yields 

Hf^-^^i-iuy-^fiu). 

Therefore, it is enough to show for / G C 1 (M\ {0}) with /, /' G L 1 (R) that < 
C|u| _1 , |u| > 1, where C > does not depend on u. The integrability of /' ensures the 
existence of the limits of / for x — > ±oo. Since / itself is absolutely integrable, those limits 
equal 0. Integration by parts applied to the piecewise C^-function verifies for u ^ 0: 



e mx f(x)dx + / e iux f(x)dx 



< 



(l/(o- 



/(o+)| + ||/'IUO- 



Part 2: From Part 1 and the Leibniz rule follow for t £ [0, 1) and \v\ > ~ 



|^(«-tt)| 



7 (s-i) 



9 s 



(0+)-^(^ ff (x)]( V ) 



7 («-i) 



(o- 



,0-i) 



2=0 



Hence, it remains to bound | T (e tx g^ (x)) (v) \ uniformly over t £ [0, 1) and \v\ > |, where 
I = 0, . . . , s. For each j = 0, ... s — 2 and £ = 0, . . . s there is a linear combination (x) — 
ELo Pm l) h m (x) with 6t,m = 0,...,j. Thus, we can find £ K, j = 0, . . . s-2, 
such that the derivatives of g are given by g^(x) = sgn(x)k^ (x) +X)?=o a oPj ' l ii x ) f° r 
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x £ K \ {0}, I — 0, ... ,s. Therefore, we obtain for all t £ [0, 1), \v\ > \ and I = 0, . . . , s 

s-2 

F(e tx g^\x)) (v)\ = \f (sgn(x)e tx k^ (x)) (v) + £ arff T {e^h^x)) (v) 

3=0 

< ||(1 V e*)k®(x)\\ Ll + J2 h_^|- +1 < 11(1 V e x )kV{x)\\» + ]T 2^ +1 )j!|a^f|. □ 

With this lemma at hand the representation (1.2) can be proved as follows: Owing to 
the symmetry ip(—u) = ip{ u )i u € R, it is sufficient to consider the case u > 0. To develop 
this integral further we consider for re (0, i) 

,1-T 

£ T (it) := / i(u — i)F(sgrx-k) {{u — i)t) dt 
Jo 

S-2 r l- T r l-T 

i(u — i)J-g((u — i)t) dt 



/ i(u — i)a.j\Fhj((u — i)t) dt + / 
„'_n Jo Jo 



3=0 

s-2 



£(j - l)!ay - a Iog(r - - r)) + £ r^j^ 

3=1 3 = 1 V V " 

+ / i(u — ijJ^gi^u — i)t) dt 



To calculate the last integral we split its domain in [0, |] and (i, 1 — r]. By assumption 
and choice of ^ we obtain le^-^^o;)! < |(1 V e a; / 2 )g(a;)| € L 1 , for < t < \, and 
thus, we can apply Fubini's theorem to the first part: 

1/2 roo rl/2 

i(u-i)Fg((u-i)t)dt= / g(x) / i(u - i)e l{u ~ l)tx dtdx. 



Since z \— > e lzx is holomorphic, Cauchy's integral theorem yields 

1/2 r (u-i)/2 [■- i/2 Au-i)/2 



i(u - i)e l{u - l)tx dt = / ie izx dz = 



JO JO J -i/2 



le az + / le 



Hence, 



1/2 /.oo / /.1/2 nu/2 \ 

i{u-i)J : g{{u-i)t)dt = / g{x)\ / e te di+ / !e* ra+I / 2 dti di. 







Another application of Fubini's theorem to the second term shows 

,1/2 

i(u — i)J-g((u — i)t) dt 



e x ' 2 - 1 



g(x) dx + i I F{e x,2 g{x))(v)dv-i / F{e x/2 g{x)){v) dv. (3.1) 

Ju/2 



7 



The first two summands are independent from u whereas we can use Lemma 9 to estimate 
the last integral for u > 1: 



u/2 



T(e x/2 g(x))(v)dv 



<C„ 



\- s dv= ^|u|- +1 . 



u/2 



(3.2) 



Also the integral over (|, 1 — r] can be estimated using Lemma 9. For all t £ (0, |) and 
for all u > 1 we obtain uniformly: 



i(u - i)Tg({u - i)t) dt < C g \{u - i)u" s \ t- s dt~\ u \ 



'1/2 

Thus, (3.1) yields 



-s+l 



(3.3) 



1/2 



s— 2 _ c 
Cr(«) = - " 1)'«J + / 



e x/2 _ i 

g(x) dx + i I j u (e x/2 g(x))(v)dv 



ao log(T — iu 



o 



s-2 



(j - 



' (r — iu(l — T)y 

J=1 



with 



p T {u) 



poo p1—t 

-i T(e x/2 g(x))(v)dv + i(u - i)Fg{{u - i)t) dt 

Ju/2 Jl/2 

OO pi — T 



(3.4) 



(3.5) 



i F{e x ' 2 g{x)){v)dv + i(u - i) (j"(sgn-fc)((u - i)t) 

Ju/2 Jl/2 ^ 



s-2 



j\ aj 



3=0 



(1 - i(u - 



dt. 



-s + l 



Plugging the estimates (3.2) and (3.3) into equation (3.5), we obtain |p T (u)| < \u\ 
uniformly over r > and u > 1. 

For u > there exists p(u) := lim T _ s .o (w) because .F(sgn-fc) is defined on {z £ 
C|Im(z) £ [—1,0]} and is continuous on its domain whereas the integral over the sum 
can be computed explicitly. Then the bound |it| _;i+1 holds for p(u), \u\ > 1, too. Also for 
small u £ (0, 1) the term |u s ~ 1 p(u)| remains bounded since p has a pole at of maximal 
order s — 2. Since all terms in (3.4) are continuous in r at this equation is true for 
t = 0. Finally, we notice log(— iu) = log(| — iu\) + i arg(-iu) = log(|u|) — iir/2 and insert 
(3.4) in (1.1). 



4. Proofs of the lower bound 



First, we are interested in the distance of Z-p and Z-p 1 with Vq^Vi £ Go(R, a). Girsanvo's 
theorem implies the equivalence of the laws of Z-p t and the likelihood ratio for Vq with 
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respect to V\, given by Liptser and Shiryaev [4, Theorem 7.18], is: 

A(V ,Vi)=exp( / (O Vl -O P0 )(x)VNX N (x)- l dW(x) 

• >-A N 

\O Vl -O V0 \ 2 (x)NX N (x)- 2 dx] 



Hence, we can bound the Kullback-Leibler divergence using the uniform bound of N/\ 2 Nl 
Parseval's identity and the pricing formula 

KL(V 1 \V )=E Vl [log(A(V 1 ,r ))] = - / \O Vl -O V0 \ 2 (x)NX N (x)- 2 dx 

poo 



e- z (loge- L r / \F{O Vi -O V0 )(u)\ z &u 



-OO 

~e- 2 (log £ -y r 'd- (4.1) 

J-oz u(u - i) 

4.1. Lower bound for 7: 

Following the standard approach, we perturb a pair Vq G Q s {R 1 a). Let Vq = (7o ; &o) 
satisfies all conditions where norms and constants are strictly smaller then R and with 
a — a. Furthermore, let fco do not decrease too rapidly, i.e., we assume ko > \x\~ p and 
l^ol ~ M~ 9 f° r some p, q > 0, and |v?t(u — i)| < |u|~ Tq hold exactly. Certainly such a 
pair exists. 

Let S > and consider T^i = (71 , fci ) given by 



7i=7o + <J and JF\ — , , — —e )(u) = —5i(u-i)e 
V a; / 



where we will choose U > 1 and to € N properly. In the following we call the difference of 

|*| 



the exponentially scaled jump measures g(x) := ^°^ e x . By construction g is real 



valued and the martingale condition is valid: 

7i + / (e x - 1) ^ dx = 7o + 6 + { (e* - 1) ^ dz + ^(0) - 
Jr Fl Jr Fl 

= 70 + ^+ / (e--l) fc °Mdx-5 = 0. 
Jr Fl 

Also the moment assumption can be checked straight forward for 6 small enough: 
E Vl [e 2XT ] = E Vo [e 2XT ] • exp (26 + f (e x - e' x )g{x) dx) 

J — OO 

= E Vo [e 2XT ] ■ exp (25(1 - e (-D m+1 /^ 2m )) < e 2S E Vo [e 2XT ] < R. 



9 

the 



Using the Schwartz-functions (1 := T 1 ( — iue " 2m ),C2 T 1 { — e " 2m ) € 
inversion and scaling properties of the Fourier transform yield for ueR 

7$(u) = SUHi(^) + SFU^j) = ^{SU'C^Ux) + 8U( 2 (Ux))(u). 

Hence, we have g(x) = 5U 2 (i{Ux) + SU&iUx) G S*(R). Even e~ x g(x) G S*(R) holds 
because of J : (e~ x (j(x)) (u) — F(j(u + i) for u e M and j G {1, 2}. 

If 5U 2 < 1 the disturbance \x\e~ x g(x) and its derivative are bounded for all U > 1, owing 
to the rescaling with [/: 

|[/a;e-%-(C/x)| < |Ua;(l V e -l7: %-(?7a;)| < \\x(l V e^C^x))^ < oo. 

Since additionally larle^p^) is fast decreasing and fco and k are bounded from below, 
the k-function k\ is non-negative and satisfies the monotonicity conditions provided 6 is 
small enough. 

The continuity and polynomial decrease of Q imply ||(1 V e~ x )Q Wl 1 < = 1>2. 

Furthermore, by construction (ki — k )(0) — and the derivatives of the disturbance is 
given by 



(fci-to)< m >(aO 



dx r < 



1=0 



e -^(x) = ^(™)(-l) i (|x|-?sgnx) e -V m - (^) 



for m = l,...,s and a; ^ 0. Hence, (fc - fci) (m) (0+) + (fc - fci) (m) (0-) = for m 
0, . . . , s and substituting y = Ux yield 



||(1 V e*)(fcd - fe 1 )t->(a?)|| £1 

S 

<s(y'\\(ive-")\y\G(y)\\ +Y1 

(iv e -^)MC 2 s (y)|l +E 



U s 



(ive-y)(\y\ + i)cr l (y) 
(ive-y)(\ y \ + i)cr l (y) 



L 1 



L 1 



and even better bounds for derivatives of lower order. Thus, the norm restrictions are 
fulfilled by choosing U ~ S" 1 ^. Additionally, 5U 2 ~ (J^- 2 )/" < 1 is valid. Therefore, 
Pi G G S {R> ctmax) holds. From Tsybakov [6, Thm. 2.2] follows the lower bound 

inf sup E P [|7- 7 | 2 ] > S 2 , 
*> PeS s (R,a max ) 

once we have shown that the Kullback-Leibler divergence is asymptotically bounded. We 
deduce from equation (4.1), the estimate |1 — e z \ < 2\z\ for all z G C in a small ball 
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around and the assumed decrease \ fr{u — i)\ < \u\ Ta for m > Ta + 1 
KL(V!\V ) < / |^„(«-*)| 2 N«5( W - J )+J- 5 (u)-^(i)| 2 ( M 4 + u 2 )- 1 d u 



e-^loge" 1 )^ 2 / \^ Vo (u~i)\ 2 \l~e- u2m / u2m \ 2 u- 2 du. 



\- 2Ta - 2 \l-e- u2m \ 2 du 



< epilogs- 1 fS 2 ^ 2 ^- 1 f 
<s- 2 (loge~ 1 fS ( ^ 2s+2Ta+1 ^ s 
Hence, KL{V X \V ) remains bounded if 8 ~ (e 2 (log£- 1 )-^) s/(2s+2T " +1) . 

4.2. Lower bound for a.,-: 

We will need the following auxiliary lemma which is shown in Section 4.4 separately: 

Lemma 10. Let j G {0, . . . , s— 1} andm G N. There is a family of functions (gu)u>i C 
C* s (M\{0}) or (gu)u>i C C^ 1 ( K )nC s (M\{0}) /or j = or j > 1, respectively, such that 
each gu has compact support and satisfies the following conditions for some constants 
S, U min > 1 and for all U > U min : 

in) gu (0+) + gu (0-) > 0, , 9 [/' ) (0+) + gjp(0-) = 1 and |<$(0+) + ^(0-)| < S for 
l = 0,...,s-l, 



Se/||°° < S and (1 V e x )|<?y- (a;)| dx < S* /or Z = 0, . .. , s as weZZ as 

^J-l i^r^"^^)^ <s, u g [-1,1]. 



Now, we are in position to prove lower bounds for ay. Since the convergence rates of 
ay decrease for rising j and because of the recursion formula in Lemma 1 , it is sufficient 
to consider fcW(0+) + fcW(O-) instead of a y Fix a j G {0, . . . , s - 2}. 
We argue analogously to the proof for the estimation of 7: Again we perturb a pair 
T^o = (70;&o) G Qs{R,ot) with exactly the same properties as above. To disturb Vq in 
a suitable way, we choose a family of functions (gu)u>i C C S (IR \ {0}) or (gu)u>i C 
C^' -1 ^) nC s (l\ {0}) for j = or j > 1, respectively, with the properties i) - v) from 
Lemma 10 for some constants S, U m i n > 1 and m G N, m > Ta + 2. For 6 > define 
^1 = (7i>fci) as 

7i : = 7o and k\(x) := k$(x) — 5U~ 3 gu{Ux), x G R. 
From i) follows the martingale condition as for 7: 

= 7o+ / ^-lj^Mds-W^' / eZ ^~ 1 gaWda ; = 7 1 + / (e x - 1) dx. 
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As long as SU^^ 1 is bounded the perturbation and its derivative are bounded in U 
such that the necessary monotonicity and non-negativity conditions of k\ follow as in the 
proof before. We derive the moment assumption using ii) 



2 A' T 



exp 



(-6U- 



J2x 



- gu (Ux)dx)E Va [e 2XT } < e ssu ~ 3 E VQ [e 



2..\ 3 



Furthermore, the smoothness of gu and condition iii) yield 

(fc - fci)W>(0+) + (fc - *i) W) (0-) = 5 and 
(Jfco - fci) (s - 1) (0+) + (k Q - fci)^ _1 >(0-) < SSU'- 1 -*. 
Using the integrability condition iv), we estimate 

||(1 V e*)(ki - fco)«|Ui = <SCr^'||(l V e x )( 9u (Ux))^\\ L1 

= SU'-^Wil V e x / u )g { j\x)\\ L i < SSU'' 1 ^ 

and better bounds for derivatives of lower order. Thus, V\ £ Gs(R,ce max ) if we choose 
U ~ £-V(»-i-j) with a constant small enough (note <5L/^' +1 - ,y(*- 2 )/(»-l-i) < 1). 

With respect to condition v), the uniform boundedness of gu, their compact support 
and the estimate (4.1) the Kullback-Leibler divergence is bounded by 



a I log e\P <-°° 



<\}ogsl s2u - 2j 



< 



log el' 9 



\<Pt,v {u - i)\ 2 ^ 

\PT,v (u - i)\ 2 



-e x (ki-ko){x)dx (u 4 + u 2 ) _1 du 



5 2 U- 2j 



— OO 

oo 



-e x gu (Ux) dx 2 (u 4 + u 2 )- 1 du 



\-2Ta-A 



oo ^iux/U y 



* /u gu(x)dx 



du 



<- 1 logg]^ g2jj-2Ta-2j-3 



-2Ta-4 



-e x/u gu (x) dx 



du 



2 (log e:~ 1 )' 9 (5 (2s+2 ' rQ+1)/(s_1_j) . 



Hence, the choice S ~ (£ 2 (log£ -1 ) - ^) 1 ^^ 2s+2Ta+1 ^ yields the claim. 

Form this proof we can conclude Corollary 8 as follows: 
Using the same perturbation Pi of a pair Vq G Q s (R,a) we obtain bounds for (fcg — 
fcl) (s_1) (0+) + (fco - fci) (s_1) (0-) and ||(1 V e x )(h - fe ) W IUi which depend only on 5. 
Thus, we choose U > 1 and S independently from each other and estimate the Kullback- 
Leibler-distance as in the theorem: 

KL{Vi\V Q ) < e-*QogE- 1 )WU-*- 2Ta - 1 . 

Therefore, for a small constant S and U ~ (e~ 2 (loge~ 1 )^) 1 ^ 2s+2Ta+1 ' ) the Kullback- 
Leibler-divergence is bounded. 
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4.3. Lower bound for k e : 

Choose some Vq = (70,^0) € H s (R,a) such that the corresponding characteristic func- 
tion decreases as |u| _Tq but all integral norms and constants are strictly smaller than 
R. For j e N let ipj e C°°(K) satisfy supp^ = [0, 1] and ||^j||i,2 = 1 as well as 

/ ipj(x)dx= / ipj(x)e~ 2 Jx dx = and / \J 7 ipj(u)u~~ x \ 2 du < 00 

for some A > Ta + 2 and have uniformly bounded norms HV'j-"^ \\l 2 < C for a constant 
C > and all j > 1, m = 0, . . . , s. Such functions exist, for instance the last property 
holds if ipj is the Ath derivative of an L 2 -function. Defining 

il>jl{x) := 2 j / 2 ipj(2 j x - I) for j >l,l = 2 j ~ 1 , 2 j ~ 1 - 1, 

we obtain JY>jz (0) = Fijjj^i) = 0. For any r = (r 2 j-i , . . . , r 2 j-i) € { — 1,1} 2J 1 we 
consider the perturbed pair V r = (70 > fcr) with 

2 3 -l 

— fco(a;) = <5e _a: |a;| ^ riipji(x), 

Hence, £; e , r (x) — fc e! o(a;) = feX^-y- 1 r i' l Pji( x ) an d satisfies the martingale condition 
and (k r - fc o )(0+) + (k r - fc )(0-) = 0. Assumption 1 with C 2 < R and E Vr [X T e XT ] = 
—iLp' TVr (—i) < R hold for 8 sufficiently small. Using the disjoint support of ipji for 
different I, we calculate for m = 0, . . . , s: 



2 J -1 .. , m n 23-1 .. 2 



\\k™-k<$\\h=# E \\-^a^fl{x) l2 =S 2 Y: \\x^\x)+m^- 1 \ 
=(5 2 2 2 jm ^ ||2-^(y + /)^ m) (y) + m2-Vi ro - 1) (2/) 



L 2 



2 

L 2 ' 



We note that y — 2^x — I e [0, 1] implies .t = 2 + /) e [1/2, 1] and estimate 

ll*^ " fce^ll! 2 < 5 2 2 2 ^+^ 1 (2||^ m) ||l 2 + 2m 2 ||^ (m - 1) || 2 2 ) < 5 2 2^ 2m+1 ). 

Since ||fc e>r — fc^olU 1 ^ <52 J / 2 follows in the same way, choosing 8 <~ 2~ : '( ;s+1 / 2 ' ensures 
■P r e U s {R,a). 

Let r, r' e { — 1, 1} 2J with Hemming distance equal to one, that is r; = r ; ' except for 
one l . Then, ipjj (x) = for \x\ < 1/2 implies 

\\Kr - K r ,\\ 2 L2 = 48 2 \\x^ lo (x)\\ 2 L2 = 48 2 \\2-i(y + lo)^(y)\\ 2 L 2>8 2 \\^\\ 2 L2 . 
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We will apply Assouad's lemma (see Tsybakov [6, Lem. 2.12]), which yields 
inf sup E v [\k e -k e \ 2 ]>V- 1 \\k^ r -k e .A 2 L2T >2^ s 

fee VeU s {R,a) 

if the Kullback-Leibler divergence for two alternatives with Hamming distance one re- 
mains bounded. This holds true by choosing 2~ J ~ (e(log £ - 1 )-^/ 2 ) 2 ^ 2s+2Tq!+5 ^ smcc 

KLiVAV^Se-^Xoge-y f°° <p TVr {u - i)? ^^' 7 K)[x \ u) V + u 2 )- 1 du 

./_™ V \x\ ' 



-2Ta-4 



Tipji (u)\ du 



/oo 
-oc 

/oo 
\u\- 2T& - i \^rjj j (2- j u)\ 2 du 
-OO 

< £ - 2 (loge- 1 )^2-^ 2s+2r »+ 5 ). 



4.4. Proof of Lemma 10 

Let j > and define C~ 1 (E) := R K . For a constant T > 1 consider the set 

■= {/ € C s (M\{0}) n C J ' _1 (R)| supp/ = [-1,1],/ satisfies condition iii), 

||/ (i) ||oo <T,l = 0,...,s} CL 2 (R). 

Certainly, ^p is non-empty if T is greater then a minimal value. Thus, each g E 
satisfies the properties ii) - iv) for some S > T big enough. To handle conditions i) and 



v), we define the functions Xi '■ [~ 1> 1] 



M- sgn(a;)a;' 1 e x / u , with / = 1, . . . , m — 1, 



and consider gu £ which satisfies gu _L xY J = 1, • . • , m — 1, where we write / _L g 
if f-,9 € L 2 [—l, 1] are orthogonal in the sense of L [— 1, 1]. Then dominated convergence 
yields for u € [—1, 1] 



-<? /u 9u 



, . I ^ | u |fe fOO 



k-l x/U 



gu(x) dx 



<«■ 



E 



e^lstKaOldx < |w| m / eN (l Ve 1 ) \ 9u {x) \dx < 2e 2 T\u\ 



Hence, condition v) is satisfied if S > 2e 2 T. Furthermore, the orthogonality of gu and 
xYl = 1, ■ • ■ , m — 1, implies 



,9u 



E 



sgn(x)x k x/u 
k\{-U) k 



9u 
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Therefore, gu satisfies condition i) if gjj _L Xm- ^ remains to prove 

n sty ^ where si v := {xf\l = 1, . . . , m}. 
This can be done by contradiction using a separation argument: 

Assume n ^j" = Since 7^ and ,2/^ are convex, the Hahn-Banach separation 
theorem and the Frechet-Riesz representation theorem imply the existence of a function 
& in the Hilbert space (L 2 [-l, 1], (■, ■)) such that (/,&/) < (g,£u) for all / G ^ T ,Vg G 
£?u- Because si-y C L 2 [— 1, 1] is a subspace and the linear functional sdfj 9j4 (g, £u) G 
R is bounded from below, (g, £y) = holds for all g G . Since stf\j is finite, we conclude 
further 

iu G UnM/ = IinM/ and (/, < 0, V/ G 
This leads to a contradiction if there exists an / G such that (/,£[/) > 0. To show 
this, we define xi G £ 2 [— 1, 1] via the pointwise limits Xl( x ) '■= urn (7->-oo xY ( x ) f° r x € K\ 
{0}, / = 1, . . . , m. These limits are linearly independent and non-zero. By the compactness 
of the interval and the uniform boundedness of the functions xY this is also an L 2 limit. 
Let £[/ = a ixY fo r some ot/ G R, Z = 1, . . . , 771. Using the Cauchy-Schwarz inequality, 

we obtain for all / G &t'- 

m m 

</, Cc/> =E fl '( (/• »> - </> X/ - xF) ) > E N ( (/' ^Mxi) ) - ll/IU" ll» - xFlM 
1=1 i=i 

> E M«/, Bgn(o,) X i) ) ~ V2T\\ Xl - xY\W) 
2=1 

Thus, for some r > we choose T big enough such that for all e = (e;)J™ 1 G { — 1, l} m 
there exists an / e G such that min; = i. .. im (/ e , eixi) > t. (This can be done by choosing 
/ e as a polynomial with 2s + 2 conditions at ±1 and as well as m linear restrictions on 
the coefficients of the polynomial since we can calculate (f e ,eiXi) explicitly.) Choosing 
a Umin > 1 such that r > v / 27 1 max /= i i ... im ||xj' m ' n - X/||l 2 ensures (f e ,€u) > with 
e = (Bga(ai))YL v 
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